Getting started with Boomi DataHub
The Boomi DataHub is a cloud-based, flexible master data synchronization service that helps you keep valuable data domains, such as customer data, consistent, reliable, and accurate.
Hub is set up, configured, and managed from a web browser in a single instance, multi-tenant environment. It synchronizes with Boomi Integration to connect to any combination of SaaS cloud, local, and hybrid environments. You can configure data sources to contribute to and/or receive quality data using integrations and the Boomi DataHub connector.
After you define the ideal record and deploy a data model, Hub identifies record elements that do not match your data quality standards. Hub maintains validated and up-to-date records, called golden records, and quarantines low-quality data for your review.
Multiple applications in your organization can coordinate and reference golden records to obtain consistent, high-quality, up-to-date information.
Boomi DataHub lifecycle
There are 4 core data management activities in Hub:
![]() | Define | Define the characteristics and criteria for data models in a domain. |
![]() | Deploy | Deploy models to a Hub repository and identify the source systems that will interact with them. |
| Synchronize | Leverage Integration to orchestrate data synchronization and design process flows to ensure data quality. | |
| Steward | Steward data as it flows into domains to resolve duplicates and fix data entry issues, as well as identify and correct inaccurate data. |
Enroll in the DataHub Essentials course to learn more about data management, data stewardship, and the lifecycle.
Boomi DataHub architecture
The Boomi Hub Cloud hosts your repositories, deployed models, and golden records. Sources use integrations to connect to the deployed model to contribute master data, access master data, or both. Models can reference data from other models in the same repository. For example, the Contact model can reference data from the customer ID field in the Customer model.
Boomi recommends you create a development repository, test repository, and production repository so you can develop and test the flow of data and prevent errors. Your production repository is the single source of truth for your business data.

Data management workflow: A quick start guide
There are 8 steps to create a new data management project in Boomi DataHub.
Step 1: Create your repositories
Create a repository that will host your master data, models, and source configurations. Repositories are virtual runtimes for your validated, trusted data. The data in a repository is hosted in the Boomi Hub Cloud. By default, you can have up to three repositories.
Boomi recommends that you create the following three repositories to minimize the risk of errors to live master data:
-
Development repository - use this repository to establish and update deployed models and source settings with a small amount of data. It allows developers to safely experiment with new models and updates.
-
Test repository - use this repository with a larger amount of data to test connections and ensure data flows correctly between golden records and sources.
-
Production repository - use this repository to contain the actual, live master data that is accessed by data users for business decisions.
Read Repositories overview and Creating a Repository to learn more.
Step 2: Create integrations in Boomi Integration
Create integrations that will flow data to and from sources using Boomi Integration and the Boomi DataHub connector. Read the following topics to help you:
- Building an integration process for an initial load
- Building an integration process to batch incoming updates
- Boomi DataHub connector
- Boomi DataHub APIs
Although you can use the Boomi DataHub APIs to build integrations between your sources and repository, using Integration simplifies the process because it:
- Does not require coding to build integrations
- Contains built-in tools to deploy and manage processes
- Allows you to use the Boomi DataHub connector, which handles the technical aspects of exchanging data between sources and repositories
- Simplifies administration. Integration and Boomi DataHub are interconnected
Step 3: Create sources in Boomi DataHub
Establish source connections that will contribute data to the repository, accept record updates, or both. Source applications can be local or cloud. Read Creating a source to learn more.
Step 4: Create a model
Create a model that defines the structure of golden records. Models contain rules to identify new records, identify record updates, and quarantine low-quality data. Read Creating a model to learn more.
Step 5: Configure source settings in the model
You can specify how sources contribute and accept data. Source configurations automatically attach to any deployed model across repositories. Read Adding a source to a model to learn more.
Step 6: Publish and deploy the model to your repository
Hub uses your deployed model to load data from sources, create golden records, and maintain master data in your repository. Read Publishing a model and Deploying a model to a repository to learn more.
Step 7: Synchronize and load data from sources
Load data from sources into your repository. Read Loading data from a source to learn more.
Step 8: Steward data in golden records
View golden records and quarantined data. Read Viewing domain data and Viewing a domain’s quarantine entries to learn more.

